criterion validity in research example

10 Criterion Validity Examples

Dave Cornell (PhD)

Dr. Cornell has worked in education for more than 20 years. His work has involved designing teacher certification for Trinity College in London and in-service training for state governments in the United States. He has trained kindergarten teachers in 8 countries and helped businessmen and women open baby centers and kindergartens in 3 countries.

Learn about our Editorial Process

Chris Drew (PhD)

This article was peer-reviewed and edited by Chris Drew (PhD). The review process on Helpful Professor involves having a PhD level expert fact check, edit, and contribute to articles. Reviewers ensure all content reflects expert academic consensus and is backed up with reference to academic studies. Dr. Drew has published over 20 academic articles in scholarly journals. He is the former editor of the Journal of Learning Development in Higher Education and holds a PhD in Education from ACU.

Criterion validity is a type of validity that examines whether scores on one test are predictive of performance on another.

For example, if employees take an IQ text, the boss would like to know if this test predicts actual job performance.

If an IQ test does predict job performance, then it has criterion validity.
If an IQ test does not predict job performance, then it does not have criterion validity.

To make that determination, a correlation is calculated between the IQ scores and a measure of job performance.

The higher the value of the correlation, the stronger the relation between the two and the higher the criterion validity.

Sometimes this is also called predictive validity .

Of course, there are other factors related to job performance, so the correlation will never be perfect (i.e., 1). In most situations, there will be many factors associated with a particular performance outcome, and in some cases, hundreds.

Examples of Criterion Validity

1. leadership inventories and leadership skills.

A leadership inventory can predict whether someone will be good in a leadership role.

Predictor Variable : High score on a leadership inventory Criterion Variable: Aptitude for a leadership role

It takes a long time to know which employees have leadership potential. They have to be seen in various situations over a period of years to develop a solid understanding of their personality and ability to handle pressure.

That is very inefficient, especially for a new company that is expanding rapidly.

This is where personality inventories come into play. By administering a test that assesses leadership traits, a company can obtain a lot of data about a large number of employees very rapidly.

The key issue is: make sure a test with criterion validity is administered. As long as the test has criterion validity, it will be able to predict, with some degree of accuracy , which employees are suitable for a leadership role.

Here is a list of commonly used leadership inventories.

2. The SAT and College GPA

Studies have found that SATs have a weak to moderate ability to predict your college GPA.

Predictor Variable: SAT Score Criterion Variable: College GPA

There have been many studies on the criterion validity of SAT scores in predicting college GPAs (Kobrin et al., 2008).

The basic premise is that the SAT has criterion validity regarding college performance. The typical studying involves obtaining the SAT scores of hundreds, even thousands of students, and then correlating those scores with first year or final year GPAs.

Although it is difficult to make a sweeping statement that adequately covers so many studies, the results range from finding weak to moderately strong associations between SAT and GPA.

A moderately strong association is more impressive than what it sounds. It’s just one score on one test, but it predicts future performance on a criterion fairly well. If researchers were to include other factors, such as motivation and time-management skills, the ability to predict a student’s college GPA would become increasingly more accurate.

3. The Housing Market

Predictor variables, including number of new homes purchased, building permits awarded, interest rates on mortgages, and employment rate have high criterion validity in predicting the prices of houses .

Predictor Variables: Building permits issued, interest rates on mortgages, employment rate Criterion Variable: Housing prices

The housing market is a classic indicator of economic performance. The volume of sales each quarter are affected by numerous factors, including: the employment rate, interest rates, building supply, and consumer confidence, just to name a few.

Each one of those factors can be measured and correlated with the housing market. Some factors have strong criterion validity, while others may have moderate or low criterion validity. However, when economists put them all together, the ability to predict the housing market improves significantly.

Of course, it’s still not an exact science, so there will always be some margin of error in those forecasts.

4. Psychological Correlates of Academic Performance

Self-efficacy and effort management are found to have high criterion validity in academic performance tests because they are predictors of a high GPA.

Predictor Variable: Self-efficacy and effort management Criterion Variable: A high GPA

Richardson et al. (2012) examined a large number of studies between 1997 and 2010 that involved identifying psychological variables associated with academic performance. The researchers included over 7,000 studies and identified over 80 distinct variables that correlated with GPA.

Each of those 80 variables have a degree of criterion validity. That is, a student’s score on each one of those variables is predictive of grades to some extent. The real question is: which ones are the best predictors?

After conducting some very thorough analyses, the results indicated that psychological factors such as self-efficacy and effort management were the strongest correlates of GPA. In other words, student self-efficacy and effort management have criterion validity regarding GPA.

5. The In-Basket Activity

The in-basket job simulation test examines a manager’s ability to prioritize tasks. It gets a job applicant to sort items in an in-basket and sort the order in which to do them.

Predictor Variable: Performance in the in-basket exercise Criterion Variable: Applicant’s aptitude as a manager

The In-Basket activity is a job simulation task that is designed to assess an applicant’s ability to prioritize.

First, the applicant is seated at an official-looking desk and instructed to sort through the in-basket documents. The basket contains memos, email printouts, messages, and descriptions of various tasks that the company needs completed.

The applicant is given a short period of time to read the assorted documents and arrange them in order of priority.

This is an example of the type of assessment tool that an HR department will implement because they believe it has criterion validity. Performance in this activity is predictive of the ability to prioritize competing demands on the job.

6. Criterion Validity and Life-Expectancy

A life-expectancy test will have criterion validity if it can reliably predict the correlation between a predictor variable such as frequent exercise and longevity of life.

Predictor Variable: Regular exercise Criterion Variable: A long life.

It seems like every month another study on life-expectancy is published.

Many of the studies have similar methodologies; at stage 1, thousands of people are assessed on a multitude of factors, including dietary habits, frequency of exercise, and psychological factors such as social support and personality characteristics.

At stage 2, approximately 20-50 years later, the researchers gather data on physical health such as cardiovascular disease and cancer.

By examining the correlations between the factors assessed at stage 1 with the health status of participants at stage 2, the researchers can determine which factors have criterion validity. That is, which factors at stage 1 are related to health at stage 2.

7. The NFL Combine

The NFL Combine is an annual test of college football platers’ aptitude to play in the NFL. Most of these tests don’t have criterion validity, but the sprint test for running backs does predict future performance in the NHL.

Predictor Variable: NFL combine sprint test Criterion Variable: Running back performance in the NFL

Every year, top college football players are invited to participate in the NFL’s combine. The event lasts several days and involves each athlete going through a wide range of physical challenges, such as running the 40-yard dash, jumping as high as they can, and taking an interesting IQ test called the Wonderlic.

Head coaches, scouts, and owners put a lot of faith in the results of these tests, but no one is really sure why. As research by Kuzmits & Adams (2008) has revealed, there is “…no consistent statistical relationship between combine tests and professional football performance, with the notable exception of sprint tests for running backs” (p. 1721). For a non-technical explanation, click here .

The NFL combine may be one of the most enduring set of tests that completely lack criterion validity.

8. Bus Driver Course Performance and Bus Accidents

To test the criterion validity of a driver course, researchers would have to follow-up on large experimental and control groups to see whether those who took the driver course were in less accidents.

Predictor Variable: Taking a bus driver safety course Criterion Variable: Having less bus accidents on the job

Hiring skilled and cautious bus drivers is a paramount concern for many municipalities. A single accident can result in numerous injuries. Add in the duration of driving times and the number of buses operating at any given time, and the situation is ripe for frequent accidents.

Therefore, bus companies need to select their drivers carefully. One component of the hiring process involves applicants driving through a standardized course. The course has been designed to mimic several characteristics found in real driving conditions and each applicant’s performance can be objectively measured and scored.

When that score is then correlated with actual driving records of hired drivers over the next few years, its criterion validity can be assessed.

Hopefully, the bus company will discover that the driving course has criterion validity. In other words, performance on the course can predict actual job performance. So, applicants that do poorly on the course, should not be hired.

9. Job Simulation and Nursing Competence

Evaluations of competence sometimes have low criterion validity. For example, one study of nursing competence from external experts did not correlate with the evaluations of the day-to-day supervisors of those nurses, suggesting that either the experts or supervisors are conducting assessments with low criterion validity.

Predictor Variable: Assessments of performance by supervisors Criterion Variable: Actual on-the-job performance

Nursing is an incredibly high-pressure, high-stakes occupation. Poor job performance can result in serious injury or worse. Therefore, the ability to develop accurate measures of performance that have criterion validity is of substantial importance.

Unfortunately, relying on a paper and pencil measurement of skills fails to replicate the high-stress situations that nurses often find themselves facing.

However, “Evaluation of clinical performance in authentic settings is possible using realistic simulations that do not place patients at risk” (Hinton, et al., 2017, p. 432).

In the Hinton et al. study, nurses engaged in specific medical-surgical test scenarios with manikins in a high-fidelity laboratory while being observed by experienced professionals. Those ratings were then compared to their supervisor’s ratings on the job.

In this example, the researchers were attempting to establish the criterion validity of the simulation scenarios to predict on-the-job performance. Despite all the effort that went into this study, scores on the simulated scenarios “… were not well correlated with self-assessment and supervisor assessment surveys” (p. 455).

10. Wearable Trackers and Steps Walked

Step counters that you wear on your watch apparently have high criterion validity. To test this, Adamakis (2021) got people to jog on a treadmill, counted their steps, then compared it to the results on the step counter. The step counters did pretty well!

Predictor Variable: Steps recorded on a step counter Criterion Variable: Actual steps walked

Ever wonder if those activity trackers on your phone are accurate? Well, research by Adamakis (2021) may shed some light on this question.

In this study, thirty adults wore two smartphones (one Android and one iOS), while running four apps: Runtastic Pedometer, Accupedo, Pacer, and Argus. They walked and jogged on a treadmill at three different speeds for 5 minutes. Two research assistants counted every step they took with a digital counter.

Criterion validity of the apps was then assessed by comparing the data from the apps with the 100% accurate digital counters. The results revealed that “The primary finding regarding step count was that all freeware accelerometer-based apps were valid…when comparing iOS and Android apps, Android apps performed slightly more accurately than iOS ones” (p. 9).

So, it seems that these apps have acceptable criterion validity, at least when it comes to counting steps.

This study was also a good example of concurrent validity because the validity of one test was established by conducting the test concurrently (e.g. at the same time) as another test known to be valid, to see if they get the same results.

With the prevalence of tests used to determine who gets into college or who gets hired as a bus driver, it would be nice to know if those tests are accurate. That is, is a person’s score on a given test at all related to actual performance, either at school or on the job?

As it turns out, there is a way to make this determination, and it’s called criterion validity. The usual methodology involves administering the test to a group of people and then assessing their performance in a given domain at a later date. That later date could be a matter of months or several years.

Fortunately, researchers have conducted a great deal of studies examining the criterion validity of thousands of various tests. Tests that lack support are usually dropped or modified, while tests that are supported by research can be used in many practical situations.

Adamakis, M. (2021). Criterion validity of iOS and Android applications to measure steps and distance in adults. Technologies, 9 , 55. https://doi.org/10.3390/technologies9030055

Cohen, R. J., & Swerdlik, M. E. (2005). Psychological testing and assessment: An introduction to tests and measurement (6th ed.). New York: McGraw-Hill.

Hinton, J., Mays, M., Hagler, D., Randolph, P., Brooks, R., DeFalco, N., Kastenbaum, B., & Miller, K. (2017). Testing nursing competence: Validity and reliability of the nursing performance profile. Journal of Nursing Measurement, 25 (3), 431. https://doi.org/10.1891/1061-3749.25.3.431

Kobrin, J. L., Patterson, B. F., Shaw, E. J., Mattern, K. D., & Barbuti, S. M. (2008). Validity of the SAT for predicting first-year college grade point average (College Board Research Report No. 2008-5). New York, NY: College Board.

Richardson, M., Abraham, C., & Bond, R. (2012). Psychological correlates of university students’ academic performance: A systematic review and meta-analysis. Psychological Bulletin , 138 (2), 353.

Dave Cornell (PhD) https://helpfulprofessor.com/author/dave-cornell-phd/ 23 Achieved Status Examples
Dave Cornell (PhD) https://helpfulprofessor.com/author/dave-cornell-phd/ 25 Defense Mechanisms Examples
Dave Cornell (PhD) https://helpfulprofessor.com/author/dave-cornell-phd/ 15 Theory of Planned Behavior Examples
Dave Cornell (PhD) https://helpfulprofessor.com/author/dave-cornell-phd/ 18 Adaptive Behavior Examples

Chris Drew (PhD) https://helpfulprofessor.com/author/chris-drew-phd/ 23 Achieved Status Examples
Chris Drew (PhD) https://helpfulprofessor.com/author/chris-drew-phd/ 15 Ableism Examples
Chris Drew (PhD) https://helpfulprofessor.com/author/chris-drew-phd/ 25 Defense Mechanisms Examples
Chris Drew (PhD) https://helpfulprofessor.com/author/chris-drew-phd/ 15 Theory of Planned Behavior Examples

Criterion Validity | Definition, Types & Examples

Introduction

What is criterion validity, why is criterion validity important, types of criterion validity, other types of validity, establishing criterion validity.

Criterion validity is a key concept in research methodology , ensuring that a quantitative test or measurement accurately reflects the intended outcome. This form of validity is essential for evaluating the effectiveness of various assessments and tools across different fields.

By comparing a new measure to outcomes from an established test, researchers can determine how well the new measure predicts or correlates with the criterion outcome. Understanding criterion validity is crucial for designing a new valid measure and conducting robust quantitative research.

This article outlines the definition, types, and examples of criterion validity, providing a clear and concise guide for researchers and students alike.

Criterion validity refers to the extent to which a measurement or test accurately predicts or correlates with an outcome based on an established criterion.

It is a critical aspect of evaluating the effectiveness of a new tool or assessment method. The primary goal of criterion validity is to determine whether the results of a new and validated measure align with the outcomes of previous measures.

There are two main ways to assess criterion validity: concurrent validity and predictive validity. Concurrent validity examines the correlation between the new measure and an established measure taken at the same time. This type of validity is useful when researchers need to validate a new test quickly, using existing, reliable data.

Predictive validity, on the other hand, assesses how well the new measure predicts future outcomes. This approach is particularly valuable in fields like psychology, education, and health, where predicting future performance or behavior is crucial.

For instance, in educational testing, a new reading comprehension test may be evaluated for criterion validity by comparing its results with those of an established test known to be reliable. If the new test's scores closely match the established test's scores, it demonstrates high concurrent validity. Alternatively, if the new test can accurately predict students' future academic performance, it exhibits high predictive validity.

Establishing criterion validity is essential for ensuring that a measurement tool is both accurate and reliable. Without it, the results of a study or assessment may be questionable, leading to incorrect conclusions or ineffective interventions.

Therefore, understanding and applying criterion validity is fundamental in research to develop robust and credible measurement instruments.

Criterion validity is crucial because it ensures the accuracy and reliability of measurement tools used in research. By confirming that a new measure accurately reflects or predicts an established criterion, researchers can trust the results and conclusions drawn from their studies.

One primary reason criterion validity is important is that it helps verify the effectiveness of new assessment tools. For instance, in educational settings, developing a new test for measuring students' mathematical abilities requires validation against a trusted, established test. If the new test demonstrates high criterion validity, educators can confidently use it to assess and improve students' skills.

In clinical psychology, criterion validity is vital for diagnosing and predicting outcomes. A new diagnostic tool for depression, for example, must be validated against existing, reliable diagnostic criteria.

High criterion validity ensures that the new tool accurately identifies individuals with depression and predicts their future mental health outcomes. This validation is essential for providing effective treatment and support.

Criterion validity also plays a significant role in employment and organizational settings. For example, when developing a new job performance assessment, it is important to validate it against current performance measures.

High criterion validity indicates that the new assessment accurately predicts job performance, aiding in effective hiring and employee development decisions.

Moreover, criterion validity contributes to the persuasiveness of research findings. When a measure demonstrates strong criterion validity, researchers can apply the findings across different contexts and populations with greater confidence. This broader applicability enhances the impact and utility of research.

Criterion-related validity is divided into several types, each serving a distinct purpose in evaluating how well a new measure correlates with or predicts an established outcome.

The primary types of criterion validity are concurrent validity, predictive validity, convergent validity, and discriminant validity. Each type provides a different perspective on the effectiveness and reliability of a measurement tool.

Concurrent validity

Concurrent criterion validity assesses the extent to which a new measure correlates with an established measure taken at the same time. This type of validity is particularly useful when researchers need to validate a new tool quickly using existing, reliable data.

By comparing the results of the new measure with those of a well-established criterion measure administered concurrently, researchers can determine if the new measure produces similar outcomes.

For example, in the context of educational testing, if a new reading comprehension test is administered alongside a well-validated reading test, and the scores from both tests show a strong correlation, the new test is said to have high concurrent validity. This indicates that the new test is effective in measuring the same construct as the established test.

Predictive validity

Predictive criterion validity evaluates how well a new measure predicts future outcomes based on an established criterion. This type of validity is essential in fields where forecasting future performance or behavior is critical, such as psychology, education, and healthcare.

By demonstrating predictive validity, researchers can show that their new measure is not only reliable in the present but also useful for making accurate predictions about future events or behaviors.

For instance, a new aptitude test designed to predict students' success in college might be validated by comparing its scores to students' future academic performance. If the test scores accurately predict how well students will perform in their college courses, the test exhibits high predictive validity. This type of validity is crucial for creating tools that aid in long-term planning and decision-making.

Convergent validity

Convergent validity is a subtype of criterion validity that examines whether a measure correlates well with other measures of the same construct. High convergent validity indicates that the new measure is consistent with other established measures that assess the same concept.

This type of validity is essential for ensuring that different tools intended to measure the same construct produce similar results, thereby supporting the robustness and credibility of the new measure.

For example, if a new scale for measuring anxiety levels shows a high correlation with existing, validated anxiety scales, it demonstrates high convergent validity. This consistency across different measures provides confidence that the new scale is accurately assessing anxiety.

Discriminant validity

Discriminant validity, another subtype of criterion validity, evaluates whether a measure does not correlate with measures of different constructs. High discriminant validity ensures that the new measure is distinct and not merely reflecting other unrelated constructs.

This type of validity is important for establishing the uniqueness of a new measure and confirming that it is not inadvertently assessing something else.

For instance, if a new test designed to measure depression shows low correlation with measures of unrelated constructs like intelligence or physical health, it has high discriminant validity. This demonstrates that the new test specifically assesses depression without being confounded by other relevant criterion variables.

Research made simple and easy with ATLAS.ti

Look to our powerful data analysis tools to unlock the insights in your research. Start with a free trial today.

In addition to criterion validity, several other types of validity are essential for evaluating the accuracy and reliability of measurement tools. These include construct validity, face validity, and content validity. Each type of validity addresses different aspects of how well a test or instrument measures the intended construct.

Construct validity

Construct validity refers to the extent to which a test or instrument accurately measures the theoretical construct it is intended to measure. It involves both convergent and discriminant validity, as it assesses whether the test correlates well with other measures of the same construct (convergent validity) and does not correlate with measures of different constructs that should not be related (discriminant validity). Establishing construct validity is crucial for ensuring that the test truly reflects the underlying theoretical concept.

For example, a new intelligence test should demonstrate construct validity by correlating well with other established intelligence tests (convergent validity) and not correlating with unrelated constructs like personality traits (discriminant validity). High construct validity ensures that the test is a meaningful measure of intelligence.

Face validity

Face validity is the extent to which a test appears to measure what it claims to measure, based on subjective judgment. Unlike other forms of validity, face validity does not involve statistical analysis but rather relies on the assessment of experts or stakeholders.

Although face validity does not rely on statistical validation, it is important for ensuring that the test is acceptable to those who use or take it.

For instance, a customer satisfaction survey should have items that clearly relate to customer experiences and perceptions. If the survey items are straightforward and relevant, the survey is said to have high face validity. While not a statistically based measure, face validity helps ensure that the test is perceived as relevant and appropriate by its users.

Content validity

Content validity assesses whether a test adequately covers the entire range of the construct it aims to measure.

This type of validity involves a thorough examination of the test items to ensure they represent all aspects of the construct. Content validity is particularly important in educational and psychological testing, where comprehensive coverage of the subject matter is essential.

For example, a math proficiency test should include items that cover all relevant areas of mathematics, such as algebra, geometry, and arithmetic. High content validity ensures that the test provides a complete and accurate assessment of the construct.

Criterion validity requires demonstrating that a new measurement tool or test accurately reflects or predicts an outcome based on an established criterion. This process requires rigorous methods to ensure the new measure is both reliable and valid.

The following subsections outline the key steps in establishing criterion validity.

Selecting appropriate criteria

The first step in establishing criterion validity is to select appropriate criteria against which the new measure will be evaluated. The chosen criteria should be well-established, reliable, and relevant to the construct being measured.

For example, if a new test is designed to assess academic performance, the criterion measure could be students' grades or scores from a widely accepted standardized test. The selected criterion should accurately reflect the construct to ensure a meaningful comparison.

Criterion validity testing

Criterion validity testing involves comparing the new measure with the chosen criterion measure to evaluate their relationship. This process typically includes conducting correlational studies to determine the strength and direction of the relationship between the two measures.

A high correlation indicates that the new measure has strong criterion validity, suggesting it accurately reflects the criterion measure.

For example, to test the criterion validity of a new depression scale, researchers might administer both the new scale and an established depression inventory to the same group of participants.

By analyzing the correlation between the scores from both measures, researchers can assess the criterion validity of the new scale. Statistical techniques, such as Pearson's correlation coefficient, are commonly used to quantify the strength of the relationship.

Addressing potential challenges

Establishing criterion validity can present several challenges, such as finding suitable criterion measures and accounting for external factors that may influence the results. Researchers must carefully select criterion measures that are not only relevant but also free from biases and errors.

Additionally, external factors, such as participants' varying levels of motivation or environmental influences, can affect the accuracy of the criterion validity assessment.

To address these challenges, researchers should conduct thorough pilot testing and use multiple criterion measures when possible. Employing a range of statistical techniques can also help to control for external factors and provide a more robust evaluation of criterion validity.

Powerful qualitative data analysis starts with ATLAS.ti

Download a free trial of ATLAS.ti to see how you can make the most of your data.

Instant insights, infinite possibilities

What is criterion validity?

Last updated

7 February 2023

Reviewed by

Cathy Heath

Criterion validity or concrete validity refers to a method of testing the correlation of a variable to a concrete outcome. Higher education institutions and employers use criterion validity testing to model an applicant's potential performance. Some organizations also use it to model retention rates.

Before moving on, let's differentiate between norm-referenced tests and criterion-referenced tests. You've probably taken a few norm-referenced tests in school—the standardized tests taken in grade school and high school.

These tests measure a student's essential knowledge in core subjects, check whether they’re performing at their grade level, and measure their knowledge compared to other students.

We'll consider the difference between norm-referenced and criterion-referenced tests in detail later in this article. First, let's consider the types of criterion validity.

Make research less tedious

Dovetail streamlines research to help you uncover and share actionable insights

Types of criterion validity

Two types of criterion validity exist:

Predictive validity: models the likelihood of an outcome

Concurrent validity: confirms whether one measure is equal or better than another accepted measure when testing the same thing at the same time

Predictive validity

A test that uses predictive validity aims to predict future performance, behavior, or outcome. Either the test administrator or the test taker can use the results to improve decision-making.

Real-world use of predictive validity tests

An employer might administer a predictive validity test to determine whether a person is likely to perform well in a specific job. To do this accurately, the employer must have a large data set of people who have already performed successfully in the job.

This example of employers administering a screening test is the most common application of predictive validity, but there are lots of other uses. For example, psychiatrists and psychologists might administer a psychological personality inventory to help diagnose a patient and gauge the possibility or probability of future behaviors.

General practice doctors (GPs) use risk factor surveys as a part of their new patient intake packets. The survey's results predict a patient's potential for developing a disease. For example, a doctor might counsel a patient who smokes two packs of cigarettes a day that their behavior increases their risk of developing lung cancer.

High school students who take the SAT are taking a predictive validity test. College students who take the GRE to gain admittance to graduate school are also taking a predictive validity test. The SAT and GRE both offer criterion validity because, over the long term, students' scores prove a valid predictor of their future academic performance as measured by their grade point average (GPA) if admitted to a college or grad school.

Concurrent validity

A test that uses concurrent validity tests the same criterion as another test. To ensure the accuracy of your test, you administer it as well as an already accepted test, which is scientifically proven to measure the same construct.

By comparing the results of both tests, you can determine whether the one you have developed accurately measures the variable you’re interested in. This type of criterion validity test is used in the fields of:

Social science

Real-world use of concurrent validity tests

If a psychologist developed a new, self-reported psychological test for measuring depression called the Winters Depression Inventory (WDI), they'd need to test its validity before using it in a clinical setting. They'd recruit non-patients to take both inventories—their new one and a commonly accepted, established one, such as the Beck Depression Inventory (BDI).

The sample group would take both inventories under controlled conditions. They would then compare the two test results for each member of the sample group. This process determines that the test they developed measures the same criterion at least as well as the accepted gold standard.

In statistical terminology, when the results of the sample population's WDI and BDI match or are close to each other, they're said to have a high positive correlation. In this scenario, the psychologist has established the concurrent validity of the two inventories.

How to measure criterion validity

That brings us to measuring criterion validity. In our example of the psychology inventories, we discussed determining if the inventories functioned concurrently and to what extent they correlated, either positively or negatively.

To measure criterion validity, use an established metric. There are many options to choose from, including:

Pearson Correlation Coefficient

Spearman's Rank Correlations

Phi Correlations

Which correlation coefficient or method you use depends on:

Whether you're analyzing a linear or non-linear relationship

The number of variables in play

The distribution of your data

Advantages of criterion validity

Criterion-referenced tests offer numerous advantages over norm-referenced tests when used to measure student or employee progress:

You can design the test questions to match (correlate to) specific program objectives.

Criterion validity offers a clear picture of an individual's command of specific material.

You can create and manage criterion-referenced tests locally.

The local administrator, such as a doctor or teacher, can diagnose problems using the test results and work with the individual to improve their situation.

Disadvantages of criterion validity

Criterion-referenced tests also have some significant disadvantages:

Building these reliable and valid test instruments is expensive and time-consuming.

You can't generalize findings beyond the local application, so they don't work to measure large group performance across a broad set of locations.

The test takers could invalidate results by accessing the test questions before taking the test.

Applications of criterion validity

To use criterion validity, you need both a predictor variable and a criterion variable. Examples of the predictor variable include the GRE or SAT. For the criterion variable, you need a known, valid measure for predicting the outcome of your interest.

In some areas of study, such as social sciences, the lack of relevant criterion variables makes it difficult to use criterion validity.

Does a criterion validity test reflect a certain set of abilities?

No, a norm-referenced test reflects a particular set of abilities that are ranked. The standardized tests administered to US school students in grade school and high school, like SAT, LSAT, or GRE, reaffirm the level of learning as a score compared to other students in the same population. A population could be all sixth graders in a school, district, state, or nation.

A criterion-referenced test, such as a driving test, determines a person’s ability to drive a car safely and obey road rules to a set standard or criteria. It is a measurement of what they know themselves. Another example is a test at the end of a university semester, which is purely focused on how much a student knows about a certain topic. Students are not ranked—they either pass or fail.

The lowdown

One of the four types of validity tests, criterion validity tests the correlation of a variable to a concrete outcome. If you've sat the ASVAB, SAT, or GRE, you took a norm-referenced test, which is different from a criterion validity test. Those tests indicate to an organization the likelihood of success.

When you design a survey or test instrument, you choose valid measures and appropriate correlation coefficients. You would also test your assessment before using it in a clinical setting to ensure construct validity , content validity , and reliability.

What are correlation coefficients?

A correlation coefficient is a descriptive statistic that sums up the direction of the relationship between variables and the strength of the correlation. A correlation coefficient ranges from -1 to 1.

You can have a correlation coefficient of 0, which denotes no relationship between the variables.

What is test-retest reliability?

Also called stability reliability, test-retest reliability refers to the clinical and research practice of administering a test twice, with an interval of a few weeks or several months between the first test and the second. This approach provides evidence of the reliability of the test.

The test administrator compares the two test results to determine their correlation. A correlation of 0.80 or above can be evidence of the test's reliability.

What is construct validity?

Construct validity refers to whether a test or assessment accurately examines the construct the researcher is testing. Take the example in the article of the psychologist creating the fictional WDI and testing it alongside the existing BDI. That test scenario established the construct validity of the WDI.

If a construct validity assessment has a positive result, we described the construct examined as exhibiting convergent validity. If the assessment does not accurately test the construct, we'd use the term discriminant validity to describe it.

What is content validity?

Also referred to as logical validity, content validity refers to how comprehensively a test or assessment fully evaluates a construct, topic, or behavior. Determining content validity requires an expert in the field or topic that the assessment is testing.

What is a valid measure?

In statistical analyses, not all measures provide valid insight into a data set. Your chosen measure needs to correspond to the construct that your assessment is testing.

For example, educators have administered the SAT since 1926. In over 90 years of its administration, it has proven to be an accurate predictor of whether the test taker will fare well in college. We can say that the SAT offers a valid measure of collegiate academic success.

Should you be using a customer insights hub?

Do you want to discover previous research faster?

Do you share your research findings with others?

Do you analyze research data?

Start for free today, add your research, and get to key insights faster

Editor’s picks

Last updated: 18 April 2023

Last updated: 27 February 2023

Last updated: 22 August 2024

Last updated: 5 February 2023

Last updated: 16 August 2024

Last updated: 9 March 2023

Last updated: 30 April 2024

Last updated: 12 December 2023

Last updated: 11 March 2024

Last updated: 4 July 2024

Last updated: 6 March 2024

Last updated: 5 March 2024

Last updated: 13 May 2024

Latest articles

Related topics, .css-je19u9{-webkit-align-items:flex-end;-webkit-box-align:flex-end;-ms-flex-align:flex-end;align-items:flex-end;display:-webkit-box;display:-webkit-flex;display:-ms-flexbox;display:flex;-webkit-flex-direction:row;-ms-flex-direction:row;flex-direction:row;-webkit-box-flex-wrap:wrap;-webkit-flex-wrap:wrap;-ms-flex-wrap:wrap;flex-wrap:wrap;-webkit-box-pack:center;-ms-flex-pack:center;-webkit-justify-content:center;justify-content:center;row-gap:0;text-align:center;max-width:671px;}@media (max-width: 1079px){.css-je19u9{max-width:400px;}.css-je19u9>span{white-space:pre;}}@media (max-width: 799px){.css-je19u9{max-width:400px;}.css-je19u9>span{white-space:pre;}} decide what to .css-1kiodld{max-height:56px;display:-webkit-box;display:-webkit-flex;display:-ms-flexbox;display:flex;-webkit-align-items:center;-webkit-box-align:center;-ms-flex-align:center;align-items:center;}@media (max-width: 1079px){.css-1kiodld{display:none;}} build next, decide what to build next, log in or sign up.

Get started for free

Skip to secondary menu
Skip to main content
Skip to primary sidebar

Statistics By Jim

Making statistics intuitive

Validity in Research and Psychology: Types & Examples

By Jim Frost 3 Comments

What is Validity in Psychology, Research, and Statistics?

Validity in research, statistics , psychology, and testing evaluates how well test scores reflect what they’re supposed to measure. Does the instrument measure what it claims to measure? Do the measurements reflect the underlying reality? Or do they quantify something else?

photograph of a confident researcher because her data have high validity.

For example, does an intelligence test assess intelligence or another characteristic, such as education or the ability to recall facts?

Researchers need to consider whether they’re measuring what they think they’re measuring. Validity addresses the appropriateness of the data rather than whether measurements are repeatable ( reliability ). However, for a test to be valid, it must first be reliable (consistent).

Evaluating validity is crucial because it helps establish which tests to use and which to avoid. If researchers use the wrong instruments, their results can be meaningless!

Validity is usually less of a concern for tangible measurements like height and weight. You might have a cheap bathroom scale that tends to read too high or too low—but it still measures weight. For those types of measurements, you’re more interested in accuracy and precision . However, other types of measurements are not as straightforward.

Validity is often a more significant concern in psychology and the social sciences, where you measure intangible constructs such as self-esteem and positive outlook. If you’re assessing the psychological construct of conscientiousness, you need to ensure that the measurement instrument asks questions that evaluate this characteristic rather than, say, obedience.

Psychological assessments of unobservable latent constructs (e.g., intelligence, traits, abilities, proclivities, etc.) have a specific application known as test validity, which is the extent that theory and data support the interpretations of test scores. Consequently, it is a critical issue because it relates to understanding the test results.

Evaluating Validity

Researchers validate tests using different lines of evidence. An instrument can be strong for one type of validity but weaker for another. Consequently, it is not a black or white issue—it can have degrees.

In this vein, there are many different types of validity and ways of thinking about it. Let’s take a look at several of the more common types. Each kind is a line of evidence that can help support or refute a test’s overall validity. In this post, learn about face, content, criterion, discriminant, concurrent, predictive, and construct validity.

If you want to learn about experimental validity, read my post about internal and external validity . Those types relate to experimental design and methods.

Types of Validity

In this post, I cover the following seven types of validity:

Face Validity : On its face, does the instrument measure the intended characteristic?
Content Validity : Do the test items adequately evaluate the target topic?
Criterion Validity : Do measures correlate with other measures in a pattern that fits theory?
Discriminant Validity : Is there no correlation between measures that should not have a relationship?
Concurrent Validity : Do simultaneous measures of the same construct correlate?
Predictive Validity : Does the measure accurately predict outcomes?
Construct Validity : Does the instrument measure the correct attribute?

Let’s look at these types of validity in more detail!

Face Validity

Face validity is the simplest and weakest type. Does the measurement instrument appear “on its face” to measure the intended construct? For a survey that assesses thrill-seeking behavior, you’d expect it to include questions about seeking excitement, getting bored quickly, and risky behaviors. If the survey contains these questions, then “on its face,” it seems like the instrument measures the construct that the researchers intend.

While this is a low bar, it’s an important issue to consider. Never overlook the obvious. Ensure that you understand the nature of the instrument and how it assesses a construct. Look at the questions. After all, if a test can’t clear this fundamental requirement, the other types of validity are a moot point. However, when a measure satisfies face validity, understand it is an intuition or a hunch that it feels correct. It’s not a statistical assessment. If your instrument passes this low bar, you still have more validation work ahead of you.

Content Validity

Content validity is similar to face validity—but it’s a more rigorous form. The process often involves assessing individual questions on a test and asking experts whether each item appraises the characteristics that the instrument is designed to cover. This process compares the test against the researcher’s goals and the theoretical properties of the construct. Researchers systematically determine whether each question contributes, and that no aspect is overlooked.

For example, if researchers are designing a survey to measure the attitudes and activities of thrill-seekers, they need to determine whether the questions sufficiently cover both of those aspects.

Learn more about Content Validity .

Criterion Validity

Criterion validity relates to the relationships between the variables in your dataset. If your data are valid, you’d expect to observe a particular correlation pattern between the variables. Researchers typically assess criterion validity by correlating different types of data. For whatever you’re measuring, you expect it to have particular relationships with other variables.

For example, measures of anxiety should correlate positively with the number of negative thoughts. Anxiety scores might also correlate positively with depression and eating disorders. If we see this pattern of relationships, it supports criterion validity. Our measure for anxiety correlates with other variables as expected.

This type is also known as convergent validity because scores for different measures converge or correspond as theory suggests. You should observe high correlations (either positive or negative).

Related posts : Criterion Validity: Definition, Assessing, and Examples and Interpreting Correlation Coefficients

Discriminant Validity

This type is the opposite of criterion validity. If you have valid data, you expect particular pairs of variables to correlate positively or negatively. However, for other pairs of variables, you expect no relationship.

For example, if self-esteem and locus of control are not related in reality, their measures should not correlate. You should observe a low correlation between scores.

It is also known as divergent validity because it relates to how different constructs are differentiated. Low correlations (close to zero) indicate that the values of one variable do not relate to the values of the other variables—the measures distinguish between different constructs.

Concurrent Validity

Concurrent validity evaluates the degree to which a measure of a construct correlates with other simultaneous measures of that construct. For example, if you administer two different intelligence tests to the same group, there should be a strong, positive correlation between their scores.

Learn more about Concurrent Validity: Definition, Assessing and Examples .

Predictive Validity

Predictive validity evaluates how well a construct predicts an outcome. For example, standardized tests such as the SAT and ACT are intended to predict how high school students will perform in college. If these tests have high predictive ability, test scores will have a strong, positive correlation with college achievement. Testing this type of validity requires administering the assessment and then measuring the actual outcomes.

Learn more about Predictive Validity: Definition, Assessing and Examples .

Construct Validity

A test with high construct validity correctly fits into the big picture with other constructs. Consequently, this type incorporates aspects of criterion, discriminant, concurrent, and predictive validity. A construct must correlate positively and negatively with the theoretically appropriate constructs, have no correlation with the correct constructs, correlate with other measures of the same construct, etc.

Construct validity combines the theoretical relationships between constructs with empirical relationships to see how closely they align. It evaluates the full range of characteristics for the construct you’re measuring and determines whether they all correlate correctly with other constructs, behaviors, and events.

As you can see, validity is a complex issue, particularly when you’re measuring abstract characteristics. To properly validate a test, you need to incorporate a wide range of subject-area knowledge and determine whether the measurements from your instrument fit in with the bigger picture! Researchers often use factor analysis to assess construct validity. Learn more about Factor Analysis .

For more in-depth information, read my article about Construct Validity .

Learn more about Experimental Design: Definition, Types, and Examples .

Nevo, Baruch (1985), Face Validity Revisited , Journal of Educational Measurement.

Reader Interactions

April 21, 2022 at 12:05 am

Thank you for the examples and easy-to-understand information about the various types of statistics used in psychology. As a current Ph.D. student, I have struggled in this area and finally, understand how to research using Inter-Rater Reliability and Predictive Validity. I greatly appreciate the information you are sharing and hope you continue to share information and examples that allows anyone, regardless of degree or not, an easy way to grasp the material.

April 21, 2022 at 1:38 am

Thanks so much! I really appreciate your kind words and I’m so glad my content has been helpful. I’m going to keep sharing! 🙂

March 14, 2023 at 1:27 am

Indeed! I think I’m grasping the concept reading your contents. Thanks!

Comments and Questions Cancel reply

Criterion Validity: Definition, Types of Validity

Design of Experiments > Criterion Validity

What is Criterion Validity?

A job applicant takes a performance test during the interview process. If this test accurately predicts how well the employee will perform on the job, the test is said to have criterion validity.
A graduate student takes the GRE . The GRE has been shown as an effective tool (i.e. it has criterion validity) for predicting how well a student will perform in graduate studies.

The first measure (in the above examples, the job performance test and the GRE) is sometimes called the predictor variable or the estimator . The second measure is called the criterion variable as long as the measure is known to be a valid tool for predicting outcomes.

One major problem with criterion validity, especially when used in the social sciences, is that relevant criterion variables can be hard to come by.

Types of Criterion Validity

The three types are:

Predictive Validity : if the test accurately predicts what it is supposed to predict. For example, the SAT exhibits predictive validity for performance in college. It can also refer to when scores from the predictor measure are taken first and then the criterion data is collected later.
Concurrent Validity : when the predictor and criterion data are collected at the same time. It can also refer to when a test replaces another test (i.e. because it’s cheaper). For example, a written driver’s test replaces an in-person test with an instructor.
Postdictive validity : if the test is a valid measure of something that happened before. For example, does a test for adult memories of childhood events work?

Beyer, W. H. CRC Standard Mathematical Tables, 31st ed. Boca Raton, FL: CRC Press, pp. 536 and 571, 2002. Dodge, Y. (2008). The Concise Encyclopedia of Statistics . Springer. Vogt, W.P. (2005). Dictionary of Statistics & Methodology: A Nontechnical Guide for the Social Sciences . SAGE. Wheelan, C. (2014). Naked Statistics . W. W. Norton & Company

Criterion Validity

Reference work entry
First Online: 01 January 2020
Cite this reference work entry

Kai T. Horstmann 3 ,
Max Knaut 3 &
Matthias Ziegler 3

91 Accesses

1 Citations

Criterion validity refers to the degree to which a test score is related to a meaningful outcome or criterion of interest.

Introduction

In its most basic form, criterion validity refers to a construct’s or test score’s relatedness to a relevant outcome of interest, the criterion. Criterion validity is conceptualized as a between-person comparison important component of the validation process of a test score. Without criterion validity, the application of a test score may not be useful.

Criterion validity can be examined with criteria assessed before ( retrospective validity ), during assessing ( concurrent validity ) or after ( predictive validity ) obtaining the test score that is to be validated. Retrospective validity would be established if the test score was related to a previously assessed characteristic of a person, such as earlier scholastic performance. While this is still a common practice, the usefulness of this approach is at least questionable. Concurrent validity...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save.

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime
Available as PDF
Read on any device
Instant download
Own it forever
Available as EPUB and PDF
Durable hardcover edition
Dispatched in 3 to 5 business days
Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

AERA, APA, & NCME. (2014). Standards for educational and psychological testing . Washington, DC: AERA.

Google Scholar

Borsboom, D. (2006). The attack of the psychometricians. Psychometrika, 71 , 425–440. https://doi.org/10.1007/s11336-006-1447-6 .

Article PubMed PubMed Central Google Scholar

Borsboom, D., Mellenbergh, G. J., & van Heerden, J. (2004). The concept of validity. Psychological Review, 111 , 1061–1071. https://doi.org/10.1037/0033-295X.111.4.1061 .

Article PubMed Google Scholar

Brogden, H. E., & Taylor, E. K. (1950). The theory and classification of criterion bias. Educational and Psychological Measurement, 10 , 159–183. https://doi.org/10.1177/001316445001000201 .

Article Google Scholar

Ziegler, M. (2014). Stop and state your intentions! European Journal of Psychological Assessment, 30 , 239–242. https://doi.org/10.1027/1015-5759/a000228 .

Ziegler, M., & Brunner, M. (2016). Test standards and psychometric modeling. In A. A. Lipnevich, F. Preckel, & R. Roberts (Eds.), Psychosocial skills and school systems in the 21st century (pp. 29–55). Göttingen: Springer.

Chapter Google Scholar

Download references

Author information

Authors and affiliations.

Humboldt-Universität zu Berlin, Berlin, Germany

Kai T. Horstmann, Max Knaut & Matthias Ziegler

You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Matthias Ziegler .

Editor information

Editors and affiliations.

Oakland University, Rochester, MI, USA

Virgil Zeigler-Hill

Todd K. Shackelford

Section Editor information

Matthias Ziegler

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry.

Horstmann, K.T., Knaut, M., Ziegler, M. (2020). Criterion Validity. In: Zeigler-Hill, V., Shackelford, T.K. (eds) Encyclopedia of Personality and Individual Differences. Springer, Cham. https://doi.org/10.1007/978-3-319-24612-3_1293

Download citation

DOI : https://doi.org/10.1007/978-3-319-24612-3_1293

Published : 22 April 2020

Publisher Name : Springer, Cham

Print ISBN : 978-3-319-24610-9

Online ISBN : 978-3-319-24612-3

eBook Packages : Behavioral Science and Psychology Reference Module Humanities and Social Sciences Reference Module Business, Economics and Social Sciences

Share this entry

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Publish with us

Policies and ethics

Find a journal
Track your research

Have a language expert improve your writing

Run a free plagiarism check in 10 minutes, automatically generate references for free.

Knowledge Base
Methodology
The 4 Types of Validity | Types, Definitions & Examples

The 4 Types of Validity | Types, Definitions & Examples

Published on 3 May 2022 by Fiona Middleton . Revised on 10 October 2022.

In quantitative research , you have to consider the reliability and validity of your methods and measurements.

Validity tells you how accurately a method measures something. If a method measures what it claims to measure, and the results closely correspond to real-world values, then it can be considered valid. There are four main types of validity:

Construct validity : Does the test measure the concept that it’s intended to measure?
Content validity : Is the test fully representative of what it aims to measure?
Face validity : Does the content of the test appear to be suitable to its aims?
Criterion validity : Do the results accurately measure the concrete outcome they are designed to measure?

Note that this article deals with types of test validity, which determine the accuracy of the actual components of a measure. If you are doing experimental research, you also need to consider internal and external validity , which deal with the experimental design and the generalisability of results.

Construct validity, content validity, face validity, criterion validity.

Construct validity evaluates whether a measurement tool really represents the thing we are interested in measuring. It’s central to establishing the overall validity of a method.

What is a construct?

A construct refers to a concept or characteristic that can’t be directly observed but can be measured by observing other indicators that are associated with it.

Constructs can be characteristics of individuals, such as intelligence, obesity, job satisfaction, or depression; they can also be broader concepts applied to organisations or social groups, such as gender equality, corporate social responsibility, or freedom of speech.

What is construct validity?

Construct validity is about ensuring that the method of measurement matches the construct you want to measure. If you develop a questionnaire to diagnose depression, you need to know: does the questionnaire really measure the construct of depression? Or is it actually measuring the respondent’s mood, self-esteem, or some other construct?

To achieve construct validity, you have to ensure that your indicators and measurements are carefully developed based on relevant existing knowledge. The questionnaire must include only relevant questions that measure known indicators of depression.

The other types of validity described below can all be considered as forms of evidence for construct validity.

Prevent plagiarism, run a free check.

Content validity assesses whether a test is representative of all aspects of the construct.

To produce valid results, the content of a test, survey, or measurement method must cover all relevant parts of the subject it aims to measure. If some aspects are missing from the measurement (or if irrelevant aspects are included), the validity is threatened.

Face validity considers how suitable the content of a test seems to be on the surface. It’s similar to content validity, but face validity is a more informal and subjective assessment.

As face validity is a subjective measure, it’s often considered the weakest form of validity. However, it can be useful in the initial stages of developing a method.

Criterion validity evaluates how well a test can predict a concrete outcome, or how well the results of your test approximate the results of another test.

What is a criterion variable?

A criterion variable is an established and effective measurement that is widely considered valid, sometimes referred to as a ‘gold standard’ measurement. Criterion variables can be very difficult to find.

What is criterion validity?

To evaluate criterion validity, you calculate the correlation between the results of your measurement and the results of the criterion measurement. If there is a high correlation, this gives a good indication that your test is measuring what it intends to measure.

Cite this Scribbr article

If you want to cite this source, you can copy and paste the citation or click the ‘Cite this Scribbr article’ button to automatically add the citation to our free Reference Generator.

Middleton, F. (2022, October 10). The 4 Types of Validity | Types, Definitions & Examples. Scribbr. Retrieved 16 September 2024, from https://www.scribbr.co.uk/research-methods/validity-types/

Is this article helpful?

Fiona Middleton

Other students also liked, qualitative vs quantitative research | examples & methods, a quick guide to experimental design | 5 steps & examples, what is qualitative research | methods & examples.

Home » Validity – Types, Examples and Guide

Validity – Types, Examples and Guide

Table of Contents

Validity is a fundamental concept in research, referring to the extent to which a test, measurement, or study accurately reflects or assesses the specific concept that the researcher is attempting to measure. Ensuring validity is crucial as it determines the trustworthiness and credibility of the research findings.

Research Validity

Research validity pertains to the accuracy and truthfulness of the research. It examines whether the research truly measures what it claims to measure. Without validity, research results can be misleading or erroneous, leading to incorrect conclusions and potentially flawed applications.

How to Ensure Validity in Research

Ensuring validity in research involves several strategies:

Clear Operational Definitions : Define variables clearly and precisely.
Use of Reliable Instruments : Employ measurement tools that have been tested for reliability.
Pilot Testing : Conduct preliminary studies to refine the research design and instruments.
Triangulation : Use multiple methods or sources to cross-verify results.
Control Variables : Control extraneous variables that might influence the outcomes.

Types of Validity

Validity is categorized into several types, each addressing different aspects of measurement accuracy.

Internal Validity

Internal validity refers to the degree to which the results of a study can be attributed to the treatments or interventions rather than other factors. It is about ensuring that the study is free from confounding variables that could affect the outcome.

External Validity

External validity concerns the extent to which the research findings can be generalized to other settings, populations, or times. High external validity means the results are applicable beyond the specific context of the study.

Construct Validity

Construct validity evaluates whether a test or instrument measures the theoretical construct it is intended to measure. It involves ensuring that the test is truly assessing the concept it claims to represent.

Content Validity

Content validity examines whether a test covers the entire range of the concept being measured. It ensures that the test items represent all facets of the concept.

Criterion Validity

Criterion validity assesses how well one measure predicts an outcome based on another measure. It is divided into two types:

Predictive Validity : How well a test predicts future performance.
Concurrent Validity : How well a test correlates with a currently existing measure.

Face Validity

Face validity refers to the extent to which a test appears to measure what it is supposed to measure, based on superficial inspection. While it is the least scientific measure of validity, it is important for ensuring that stakeholders believe in the test’s relevance.

Importance of Validity

Validity is crucial because it directly affects the credibility of research findings. Valid results ensure that conclusions drawn from research are accurate and can be trusted. This, in turn, influences the decisions and policies based on the research.

Examples of Validity

Internal Validity : A randomized controlled trial (RCT) where the random assignment of participants helps eliminate biases.
External Validity : A study on educational interventions that can be applied to different schools across various regions.
Construct Validity : A psychological test that accurately measures depression levels.
Content Validity : An exam that covers all topics taught in a course.
Criterion Validity : A job performance test that predicts future job success.

Where to Write About Validity in A Thesis

In a thesis, the methodology section should include discussions about validity. Here, you explain how you ensured the validity of your research instruments and design. Additionally, you may discuss validity in the results section, interpreting how the validity of your measurements affects your findings.

Applications of Validity

Validity has wide applications across various fields:

Education : Ensuring assessments accurately measure student learning.
Psychology : Developing tests that correctly diagnose mental health conditions.
Market Research : Creating surveys that accurately capture consumer preferences.

Limitations of Validity

While ensuring validity is essential, it has its limitations:

Complexity : Achieving high validity can be complex and resource-intensive.
Context-Specific : Some validity types may not be universally applicable across all contexts.
Subjectivity : Certain types of validity, like face validity, involve subjective judgments.

By understanding and addressing these aspects of validity, researchers can enhance the quality and impact of their studies, leading to more reliable and actionable results.

About the author

Muhammad Hassan

Researcher, Academic Writer, Web developer

Content Validity – Measurement and Examples

Parallel Forms Reliability – Methods, Example...

Face Validity – Methods, Types, Examples

Internal Consistency Reliability – Methods...

Test-Retest Reliability – Methods, Formula and...

Construct Validity – Types, Threats and Examples

Open access
Published: 16 September 2024

Transcultural adaptation and validation of Persian Version of Patient Assessment of Chronic Illness Care (PACIC-5As) Questionnaire in Iranian older patients with type 2 diabetes

Sahar Maroufi 1 ,
Leila Dehghankar 2 , 3 ,
Ahad Alizadeh 4 ,
Mohammad Amerzadeh 5 &
Seyedeh Ameneh Motalebi 5

BMC Health Services Research volume 24 , Article number: 1073 ( 2024 ) Cite this article

Metrics details

The Patient Assessment of Chronic Illness Care (PACIC-5As) questionnaire has been designed to evaluate the healthcare experiences of individuals with chronic diseases such as diabetes. Older adults are at higher risk for diabetes and its associated complications. The aim of this study was transcultural adaptation and evaluation of the validity and reliability of the PACIC-5As questionnaire in older patients with diabetes residing in Qazvin City, Iran.

In this validation study, we recruited 306 older patients with diabetes from Comprehensive Health Centers in Qazvin, Iran. The multi-stage cluster sampling technique was used to choose a representative sample. The PACIC-5As questionnaire was translated into Persian using the World Health Organization (WHO) standardized method. The validity (face, content, and construct) and reliability (Cronbach’s alpha) of the PACIC-5As were assessed. Data analysis was conducted using R software and the Lavaan package.

The mean age of the older patients was 69.99 ± 6.94 years old. Most older participants were female ( n = 180, 58.82%) and married ( n = 216, 70.59%). Regarding face validity, all items of PACIC-5As had impact scores greater than 1.5. In terms of content validity, all items had a content validity ratio > 0.49 and a content validity index > 0.79. The results of confirmatory factor analysis demonstrated that the model exhibited satisfactory fit across the expected five factors, including assess, advise, agree, assist, and arrange, for the 25 items of the PACIC-5As questionnaire. The Cronbach’s alpha coefficient for the PACIC-5As questionnaire was 0.805.

This study indicates that the Persian version of the PACIC-5As questionnaire is valid and reliable for assessing healthcare experiences in older patients with diabetes. This means that the questionnaire can be effectively used in this population.

Peer Review reports

The global population is aging rapidly [ 1 ]. The fast growth of older people brings significant challenges, particularly relating to their health [ 2 ]. Approximately 75% of individuals aged 60 and above are affected by at least one chronic disease, with nearly 50% experiencing two or more chronic conditions [ 3 ]. Diabetes mellitus is among the most common and preventable chronic diseases [ 4 ]. The prevalence of diabetes is highest among older adults [ 5 ]. Nearly half of all individuals with diabetes are older adults (aged 65 years or older) [ 6 ]. The older adult population is one of the fastest-growing segments of the diabetes population. It is projected that these numbers will grow dramatically over the next few decades [ 7 ]. In Iran, Rashidi et al. (2017) found that approximately 14.4% of older adults have been diagnosed with type 2 diabetes [ 8 ].

Diabetes has emerged as one of the most serious and prevalent chronic diseases, posing a life-threatening risk, debilitating complications, substantial costs, and a reduction in life expectancy [ 9 ]. Older adults with diabetes are at serious risk of both micro and macrovascular complications [ 10 ]. Diabetes leads to an increased need for healthcare services, home care, hospitalization, and even residency in nursing homes [ 11 ]. Since self-care is a crucial aspect of diabetes management, patients need to adopt proper lifestyle habits and gain sufficient knowledge about the disease and its treatments [ 12 ].

Appropriate management of diabetes poses a significant challenge for individuals with the condition and healthcare providers [ 13 ]. Diabetes in older patients is a significant public health concern in the 21st century, and when combined with other health conditions, it can lead to exacerbated side effects [ 14 ]. Furthermore, older adults with diabetes require different types and qualities of care than other patient groups due to physiological, psychological, and social changes [ 15 ]. Iran is one of the most populous countries in the Middle East, where diabetes management is currently inadequate. Previous systematic reviews have shown that people aged over 55, especially women, tend to have poorer diabetes management [ 16 ]. A nationwide analysis of data for 30,202 patients revealed that only 13.2% of individuals with diabetes successfully attained satisfactory levels of glycemic control [ 17 ].

Managing diabetes in this group requires a comprehensive healthcare system that includes diagnosis, monitoring, and continuous medical treatment. The McColl Institute for Health Innovation developed the Chronic Care Model (CCM) to guide the delivery of healthcare services to patients with chronic conditions [ 18 ]. The CCM is a conceptual framework designed to bridge the gap between clinical research and real-life medical practice [ 19 ]. It focuses on providing proactive and planned care for chronic diseases rather than reactive and unplanned care. The CCM has six key dimensions: organization of health care, clinical information systems, delivery system design, decision support, self-management support, and community resources. It has been widely accepted for improving the care of chronically ill patients. Specifically, the “self-management support” aspect of the CCM helps patients enhance their confidence and skills to manage their illness better [ 20 ].

The Patients Assessment Chronic Illness Care (PACIC) questionnaire, developed by Glasgow et al., is used to assess patient care for chronic diseases based on the CCM [ 21 ]. It is a self-reporting instrument that provides patients’ perspectives on receiving care for chronic diseases [ 22 ]. While there are several tools to measure patients’ experiences of chronic care [ 23 ], PACIC is one of the most suitable instruments to measure the chronic care management experiences of patients as it assesses the level of alignment with the CCM [ 24 , 25 ]. The initial version of the questionnaire comprises 20 items divided into five subscales: patient activation, decision-making support, goal setting, problem-solving, and follow-up [ 26 ]. Glasgow et al. [ 24 ] expanded the PACIC questionnaire by including six additional items to assess the recommended 5As model of chronic disease care in accordance with the guidelines of the United States Preventive Services Task Force [ 27 ]. The “5As” model is an evidence-based approach to behavior change employed to improve patients’ self-management. The primary elements of this model include assessing current behavior (assess), counseling the patient (advise), reaching a shared agreement on realistic goals (agree), assisting the patient throughout the lifestyle change (assist), and providing ongoing follow-up (arrange) [ 28 ]. The PACIC-5As model has been adapted into several languages, including Hindi [ 29 ], Danish [ 30 ], French [ 31 ], Korean [ 32 ], Thai [ 33 ], Bahasa Melayu [ 34 ], Arabic [ 35 ], German [ 36 ], and Spanish [ 37 ].

Diabetes incidence has increasingly risen in Iran, especially among the older adults. Several reports consistently highlight that diabetic patients in Iran largely do not receive the necessary quality healthcare services [ 38 ]. Due to the significant role of culture and context in perceiving diabetes care [ 39 ], it needs for a questionnaire that has acceptable validity, reliability, and cultural appropriateness for older Iranian patients with diabetes. Currently, there is a lack of suitable tools to evaluate the quality of care for chronic diseases in older Iranians. The PACIC-5 A is an invaluable tool for evaluating the structure of primary care and self-management support in diabetes from the patient’s viewpoint. Therefore, this study aimed to determine the validity and reliability of the PACIC-5As instrument in a sample of older patients with diabetes in Qazvin, Iran.

The objectives of this study are the following: (1) to translate and to assess the psychometric quality of the PACIC using the appropriate psychometric tests; (2) to determine the relationship between PACIC-5As scores and older patient characteristics and the quality of diabetes-specific care received.

This validation study was carried out on 306 older patients referred to the health centers in Qazvin, Iran. The multi-stage cluster sampling method was performed for selecting the older patients. For this purpose, Qazvin City was divided into three urban zones. Then, nine Comprehensive Health Centers randomly selected from the zones (three from the first, four from the second, and two from the third zone). In each center, a certain number of older patients with diabetes were randomly identified, and they were then called for the initial screening. During the phone call, potential participants were provided with an explanation of the study’s purpose and procedures. Older patients who met the inclusion criteria were invited to come to the health centers near the region where they resided, at a certain time. The questionnaires were completed through face-to-face interviews by the first author.

Inclusion criteria were being 60 years old or over a diagnosis of diabetes for at least six months (by a specialist physician), and willingness to participate in the study. Older patients with severe physical or mental problems that hindered their ability to communicate were excluded from the study.

In Confirmatory Factor Analysis (CFA) studies, one way to determine sample size is to use the Rule of Thumb, which recommends having at least ten participants for each item in the questionnaire. It also suggests that sample sizes larger than 300 participants are generally considered suitable for most structural models in confirmatory factor analysis [ 38 ]. As a result, 306 participants were chosen for this study.

Instruments

Data were collected using a demographic and clinical characteristics checklist and the PACIC-5As questionnaire.

The socio-demographic and clinical characteristics checklist elicited information from older patients that included age, gender, marital status, education level, occupation, economic status, living arrangement, duration of diabetes, type of diabetes treatment, and diabetes-related complications.

PACIC-5As questionnaire: The PACIC questionnaire consists of 20 items and five domains of assessment (items 1, 11, 15, 20, 21), advise (items 4, 6, 9, 19, 24), agree (items 2, 3, 7, 8, 25), assist (items 10, 12, 13, 14, 26), and arrange (items 16, 17, 18, 22, 23). The questionnaire items are rated on a 5-point Likert scale, ranging from “never” with a score of 1 to “always” with a score of 5, where a higher score indicates better patient assessment of chronic illness care [ 24 ].

Translation process

After correspondence with the questionnaire designer and obtaining permission, the translation process was conducted according to the protocols of the World Health Organization (WHO) [ 40 ] as follows:

Translation of the questionnaire into Persian

Two individuals who are proficient in both English and Persian independently translated the English version of the questionnaire into Persian. One of the translators had familiarity with medical sciences and their terminology. Ultimately, we obtained two independent Persian translations of the PACIC-5As.

In the second stage, the Persian translation of the questionnaire and revised by the project executors and colleagues. They took into account all options for word or phrase equivalency to prepare a unified Persian version of the questionnaire.

Retranslation into English

In this stage, two additional translators, fluent in both languages, translated the Persian version back into English. Then, the final English version was compared to the original questionnaire by two separate translators, and minor translation errors were found and fixed.

Validity of the questionnaire

The face, content, and structural validity of the questionnaire were examined.

Face validity assessment

For quantitative face validity, 10 older individuals with diabetes were asked to rate the importance of each item in the questionnaire on a 5-point Likert scale, ranging from 1 (not important at all) to 5 (very important). The impact scores were calculated using the following formula:

The term “frequency” refers to the percentage of individuals who rated the item with scores of 4 or 5, while “importance” represents the average importance rating based on the Likert scale.

The item impact score should not be less than 1.5 to accept the face validity of each item. Only questions with item impact scores higher than 1.5 are considered acceptable in terms of face validity [ 41 ]. The impact scores of all items of PACIC-5As were greater than 1.5 (Table 1 ).

Content validity assessment

Both qualitative and quantitative methods were used to determine the validity of the content.

For qualitative assessment, the questionnaire was provided to 15 experts in geriatric medicine, geriatric nursing, nursing education, and endocrinology. They were asked to review the questionnaire based on criteria such as adherence to grammar rules, using appropriate terminology and proper phrases, and providing necessary feedback.

In the quantitative method for determining content validity, the content validity ratio (CVR) and content validity index (CVI) were calculated.

To determine the CVR, experts were initially requested to review each item based on a 3-point Likert scale (1. not essential, 2. useful but not essential, 3. essential). Then, the CVR was calculated using the following formula:

In the above formula, “ne” represents the number of experts who rated the item as “essential,” and “N” represents the total number of experts [ 42 ].

If the resulting value from the formula is greater than the critical value from the Lawshe table (which is 0.49, determined based on the evaluation of 15 experts), it indicates that the presence of the corresponding item is statistically significant ( p < 0.05) and necessary in this instrument [ 43 ].

The CVI was used to determine the relevance of the questionnaire items. The questionnaire was given to 15 experts, who rated each statement on a 4-point Likert scale (1. not relevant at all, 2. somewhat relevant, 3. moderately relevant, 4. completely relevant). The CVI score was then calculated by adding up the scores for each item rated 3 or 4, and dividing by the total number of respondents. According to the guidelines, items with scores above 0.79 were kept in the questionnaire. Items with scores between 0.70 and 0.79 were considered borderline and needed to be revised, and items scoring less than 0.70 were considered unacceptable and should be removed [ 44 ]. The CVR and I-CVI for the items of PACIC-5As are presented in Table 1 .

Construct validity assessment

Structural validity assessment.

To determine the number of factors in the scale, CFA was performed using common goodness-of-fit indices in Structural Equation Modeling (SEM) with R software and the Lavaan package.

Reliability

To assess the reliability and internal consistency of the questionnaire, Cronbach’s alpha coefficient was used. An internal consistency above 0.70 is considered acceptable [ 45 ].

Ethical consideration

The study was approved by the Ethics Committee of Qazvin University of Medical Sciences, Qazvin, Iran (IR.QUMS.REC.1401.148). Participants were informed about the objectives, procedures, potential benefits, and drawbacks of the study. They were instructed that participation was voluntary and that the confidentiality of the obtained information was guaranteed. Informed consent was obtained from all respondents.

Data analysis

R software version 4.2.2 and the Lavaan package were used for data analysis. Descriptive statistics such as mean and standard deviation were used to describe quantitative variables, including age, duration of diabetes, and PACIC-5As scores. Frequency and percentage were utilized to describe qualitative variables, including gender, marital status, education level, occupation, economic status, living arrangement, type of diabetes treatment, and diabetes-related complications. In the CFA model, the most common goodness-of-fit indices were used to assess the fit of the proposed model based on an acceptable threshold obtained through maximum likelihood estimation. The reliability and internal consistency of the questionnaire were assessed using Cronbach’s alpha coefficient. The multivariate regression model was used to determine the predictors of older patients’ perspective of diabetic care. Normal distribution of the data was confirmed by the skewness (0.078) and the kurtosis (4.329) values that were in the acceptable ranges. As Byrne (2010) [ 46 ] stated that if the skewness value is between − 2 to + 2 and the kurtosis value is between − 7 to + 7, normality of the data could be assumed. The multicollinearity issues was assessed by the variance inflation factor (VIF) and none of variables had VIF more than 5. The homoscedasticity was also evaluated and confirmed. The statistical significance level was set at p < 0.05.

The mean age of the participants was 69.99 ± 6.94 years, ranging from 60 to 96 years. The average age of diabetes onset was 12.46 ± 5.9 years. Most older patients ( n = 230, 75.16%) were utilizing insulin injections as part of their treatment regimen. The demographic and clinical characteristics of the older patients are presented in Table 2 .

Based on the information in Table 3 , the mean score of PACIC-5 A was 3.47 ± 0.24. The highest level was in advice (4.13 ± 0.30) and the lowest level was in arrange (3.21 ± 0.30).

The multivariate regression model showed that level of education was the significant predictor of older patients’ perspective of diabetic care; older patients had a better perspective if they had secondary and diploma (β = 0.11, p = 0.003) or academic (β: 0.23, p < 0 .001) educational level (Table 4 ).

CFA results

Based on the information in Table 5 , the Structural Equation Modeling (SEM) analysis of the proposed model yielded a reasonable fit, with a Root Mean Square Error of Approximation (RMSEA) of 0.07 and a chi-squared/df ratio of 2.539, both indicating acceptable model fit [ 47 ]. The Goodness-of-Fit Index (GFI) of 0.85 and Adjusted Goodness-of-Fit Index (AGFI) of 0.81 suggest a moderately good fit, while the Incremental Fit Index (IFI) of 0.72 and Comparative Fit Index (CFI) of 0.71 slightly fall below the recommended threshold of 0.90 for good fit. The Standardized Root Mean Square Residual (SRMR) of 0.08 is within the acceptable range of less than 0.08, further supporting the overall reasonable fit of the SEM model. The structural model of the questionnaire is presented in Fig. 1 .

The five-factor model for PACIC-5As obtained from confirmatory factor analysis

This study aimed to evaluate the psychometric properties of the Persian version of the PACIC-5As questionnaire in older adults with diabetes. The study also sought to establish the correlation between PACIC-5As scores and the characteristics of older patients, as well as the quality of diabetes-specific care received by older adults with diabetes in Qazvin, Iran. The study found that the mean scores on the PACIC-5As and its domains were higher than 3.0, which is similar to scores found in diabetes patients in the USA [ 22 ], Germany [ 48 ], and Switzerland [ 49 ]. This suggests that older Iranian patients receive a reasonable level of patient-centered care in accordance with the CCM.

The level of education was found to be a significant predictor of older patients’ perspective on diabetic care. Patients with higher education levels had higher PACIC-5As scores. Likewise, Rajabpour et al. (2019) [ 50 ] reported that patients with higher education had a greater perception of holistic care. Zakeri-moghadam and Sadeghi (2013) [ 51 ] also discovered a direct and significant relationship between patients’ education levels and their satisfaction with nursing services. This result may be attributed to the higher expectations of less educated patients from the healthcare staff and their care, and their lesser adaptation to their existing condition.

The CFA results confirmed the presence of five factors, which are assess, advise, agree, assist, and arrange, for the 25 items of the PACIC-5As questionnaire. Glasgow et al. (2005), the developers of the questionnaire, also confirmed the presence of these five factors based on the CMC model for patients with one or multiple chronic diseases [ 22 ]. Similarly, Wensing et al. (2008) [ 25 ] confirmed the existence of these five factors in Dutch patients with diabetes or chronic obstructive pulmonary disease (COPD) for the PACIC or PACIC-5 A questionnaire. Additionally, Shah et al. (2008) [ 37 ] verified the same for diabetic patients, and Rosemann et al. (2007) [ 20 ] for patients with osteoarthritis. However, Drewes et al. (2012) [ 52 ] could not replicate the five-factor structure in diabetic patients and long-term care patients. Similarly, Schwenke et al. (2019) [ 36 ] did not find the five-factor structure for the PACIC-5 A questionnaire among obese patients. These contradictory results may be due to methodological differences and the specific characteristics of the study samples. The statistical methods used in the studies were quite different, and those that conducted the EFA used various methods to determine the number of factors. For instance, Drewes et al. (2012) [ 52 ] focused on patients with multiple chronic conditions and employed both EFA and CFA using the split-half method. They used three types of EFA (Principal axis factoring, PAF) with oblimin rotation, parallel analysis (PA), and Velicer’s minimum average partial (MAP) test] to explore the factor structure of the data. On the other hand, Schwenke et al. (2019) [ 36 ] examined the perspective of obese patients and used EFA with PA and the eigenvalue criterion and scree plot.

The reliability of the instrument in our study, as measured by Cronbach’s alpha, was in the acceptable range (Cronbach’s alpha = 0.805). In a study by Glasgow et al. (2005) [ 22 ], a Cronbach’s alpha of 0.96 for PACIC and 0.97 for PACIC-5As was reported in diabetic patients. Schwenke et al. (2019) [ 36 ] also reported a Cronbach’s alpha of 0.93 for PACIC and 0.94 for PACIC-5As in obese patients. Additionally, Alharbi et al. (2016) [ 35 ] found a Cronbach’s alpha of 0.93 for PACIC and 0.90 for PACIC-5As. The findings from previous studies demonstrated an optimal level of internal consistency. The variation in Cronbach’s alpha values in these studies may be attributed to the differences in the sample being investigated.

Limitations

Since this study was conducted on older patients with diabetes, the results may not be generalizable to older individuals with other chronic diseases. Another limitation is the response bias, including social desirability bias that seems relevant to the geriatric population. To mitigate this limitation, we provided necessary explanations regarding the study’s objectives and the application of the results to improve the care of patients with diabetes. Another limitation of the current research is the lack of test-retest reliability assessment due to limited access to the study’s older participants. It is recommended that future studies examine the temporal stability of the assessment tool.

The results of this study indicated that the PACIC-5As is a reliable and valid tool for measuring quality of care based on the key components of the CCM and patient motivation using the “5 A” principles in older Iranian patients with diabetes. This tool can also evaluate how well physicians’ counseling reflects the 5 A-approach, which involves assessing, advising, agreeing, assisting, and arranging. Using this instrument can guide health professionals and policy-makers in Iran to improve the healthcare delivery system for older diabetic patients and enhance their satisfaction with the care they receive. In the present study, the 5As analysis showed that the scores obtained from “Arrange” and “Assiste” were less than the other “As” among older patients. So, it is suggested to use practical and cost-effective interventions for improving older patients’ perspective on diabetic care.

Availability of data and materials

All data generated or analyzed during this study will be available from the corresponding author on reasonable request.

Abbreviations

Patient Assessment of Chronic Illness Care

World Health Organization

United State

Content Validity Ratio

Content Validity Index

Confirmatory Factor Analysis

Structural Equation Modeling

Variance Inflation Factor

Standard Deviation

Interquartile Range

Root Mean Square Error of Approximation

Goodness of Fit

Adjusted Goodness of Fit Index

Incremental Fit Index

Comparative Fit Index

Standardized Root Mean Square Residual

Rashedi V, Asadi-Lari M, Foroughan M, Delbari A, Fadayevatan R. Prevalence of disability in Iranian older adults in Tehran, Iran: a population-based study. J Health Social Sci. 2016;1(3):251–62.

Google Scholar

Motalebi SA, Zajkani Z, Mohammadi F, Habibi M, Mafi M, Ranjkesh F. Effect of acupressure on dynamic balance in elderly women: a randomized controlled trial. Exp Aging Res. 2020;46(5):433–45.

Article PubMed Google Scholar

Hosseini SR, Moslehi A, Hamidian SM, Taghian SA. The relation between chronic diseases and disability in elderly of Amirkola. Salmand. 2014;9(2):80–7.

Naqshbandi M, Harris SB, Esler JG, Antwi-Nsiah F. Global complication rates of type 2 diabetes in indigenous peoples: a comprehensive review. Diabetes Res Clin Pract. 2008;82(1):1–17.

Das U, Kar N. Prevalence and risk factor of diabetes among the elderly people in West Bengal: evidence-based LASI 1st wave. BMC Endocr Disorders. 2023;23(1):170.

Article Google Scholar

Bellary S, Kyrou I, Brown JE, Bailey CJ. Type 2 diabetes mellitus in older adults: clinical considerations and management. Nat Reviews Endocrinol. 2021;17(9):534–48.

Kalyani RR, Golden SH, Cefalu WT. Diabetes and aging: unique considerations and goals of care. Diabetes Care. 2017;40(4):440.

Article PubMed PubMed Central Google Scholar

Rashedi V, Asadi-Lari M, Delbari A, Fadayevatan R, Borhaninejad V, Foroughan M. Prevalence of diabetes type 2 in older adults: findings from a large population-based survey in Tehran, Iran (Urban HEART-2). Diabetes Metabolic Syndrome: Clin Res Reviews. 2017;11:S347–50.

Sun H, Saeedi P, Karuranga S, Pinkepank M, Ogurtsova K, Duncan BB, et al. IDF Diabetes Atlas: Global, regional and country-level diabetes prevalence estimates for 2021 and projections for 2045. Diabetes Res Clin Pract. 2022;183:109119.

Zhang Y, Wieffer H, Modha R, Balar B, Pollack M, Krishnarajah G. ORIGINAL RESEARCH the Burden of Hypoglycemia in Type 2 diabetes: a systematic review of patient and economic perspectives. JCOM-Journal Clin Outcomes Manage. 2010;17(12):547.

Sinclair A, Morley JE, Rodriguez-Mañas L, Paolisso G, Bayer T, Zeyfang A, et al. Diabetes mellitus in older people: position statement on behalf of the International Association of Gerontology and Geriatrics (IAGG), the European Diabetes Working Party for older people (EDWPOP), and the International Task Force of experts in diabetes. J Am Med Dir Assoc. 2012;13(6):497–502.

Tessier DM, Lassmann-Vague V. Diabetes and education in the elderly. Diabetes Metab. 2007;33:S75–8.

Epakchipoor F, Bastani F, Pashaei SF. Self-management and medication adherence in older adults with type II diabetes referring to the endocrinology clinics of the teaching hospital affiliated to Iran University of Medical Sciences (2019). Iran J Nurs. 2021;34(129):1–14.

Roopa K, Rama Devi G. Quality of life of elderly diabetic and hypertensive people-impact of intervention programme. IOSR J Hum Soc Sci. 2014;19(3):67–73.

Araban M. Quality of midwifery care provided to women admitted for delivery in selected hospitals of Yazd. J Nurs Midwifery Quarterly-Shaheed Beheshti Univ Med Sci Health Serv. 2013;23(81):19–26.

Noshad S, Afarideh M, Heidari B, Mechanick JI, Esteghamati A. Diabetes care in Iran: where we stand and where we are headed. Annals Global Health. 2015;81(6):839–50.

Esteghamati A, Larijani B, Aghajani MH, Ghaemi F, Kermanchi J, Shahrami A, et al. Diabetes in Iran: prospective analysis from first nationwide diabetes report of National Program for Prevention and Control of Diabetes (NPPCD-2016). Sci Rep. 2017;7(1):13461.

Wagner EH, Austin BT, Davis C, Hindmarsh M, Schaefer J, Bonomi A. Improving chronic illness care: translating evidence into action. Health Aff. 2001;20(6):64–78.

Lenfant C. Clinical research to clinical practice—lost in translation? N Engl J Med. 2003;349(9):868–74.

Rosemann T, Laux G, Droesemeyer S, Gensichen J, Szecsenyi J. Evaluation of a culturally adapted German version of the Patient Assessment of chronic illness care (PACIC 5A) questionnaire in a sample of osteoarthritis patients. J Eval Clin Pract. 2007;13(5):806–13.

Von Korff M, Gruman J, Schaefer J, Curry SJ, Wagner EH. Collaborative management of chronic illness. American College of Physicians; 1997. pp. 1097–102.

Glasgow RE, Whitesides H, Nelson CC, King DK. Use of the Patient Assessment of Chronic Illness Care (PACIC) with diabetic patients: relationship to patient characteristics, receipt of care, and self-management. Diabetes Care. 2005;28(11):2655–61.

Vrijhoef HJ, Berbee R, Wagner EH, Steuten LM. Quality of integrated chronic care measured by patient survey: identification, selection and application of most appropriate instruments. Health Expect. 2009;12(4):417–29.

Glasgow RE, Wagner EH, Schaefer J, Mahoney LD, Reid RJ, Greene SM. Development and validation of the patient assessment of chronic illness care (PACIC). Med Care. 2005;43(5):436–44.

Wensing M, van Lieshout J, Jung HP, Hermsen J, Rosemann T. The patients Assessment Chronic Illness Care (PACIC) questionnaire in the Netherlands: a validation study in rural general practice. BMC Health Serv Res. 2008;8:1–6.

Chiu H-C, Hsieh H-M, Lin Y-C, Kuo S-J, Kao H-Y, Yeh S-CJ, et al. Patient assessment of diabetes care in a pay-for-performance program. Int J Qual Health Care. 2016;28(2):183–90.

Whitlock EP, Orleans CT, Pender N, Allan J. Evaluating primary care behavioral counseling interventions: an evidence-based approach. Am J Prev Med. 2002;22(4):267–84.

Vallis M, Piccinini–Vallis H, Sharma AM, Freedhoff Y. Modified 5 as: minimal intervention for obesity counseling in primary care. Can Fam Physician. 2013;59(1):27–31.

PubMed PubMed Central Google Scholar

Koley M, Saha S, Ghosh S, Nag G, Kundu M, Mondal R, et al. Patient-assessed chronic illness care (PACIC) scenario in an Indian homeopathic hospital. J Traditional Complement Med. 2016;6(1):72–7.

Maindal HT, Sokolowski I, Vedsted P. Adaptation, data quality and confirmatory factor analysis of the Danish version of the PACIC questionnaire. Eur J Public Health. 2012;22(1):31–6.

Krucien N, Le Vaillant M, Pelletier-Fleury N. Adaptation and validation of the patient assessment of chronic illness care in the French context. BMC Health Serv Res. 2014;14:1–12.

Rick J, Rowe K, Hann M, Sibbald B, Reeves D, Roland M, et al. Psychometric properties of the patient assessment of chronic illness care measure: acceptability, reliability and validity in United Kingdom patients with long-term conditions. BMC Health Serv Res. 2012;12:1–15.

Zeugfang D, Wisetborisut A, Angkurawaranon C, Aramrattana A, Wensing M, Szecsenyi J, et al. Translation and validation of the PACIC + questionnaire: the Thai version. BMC Fam Pract. 2018;19:1–10.

Abdul-Razak S, Ramli AS, Badlishah-Sham SF, Haniff J. Validity and reliability of the patient assessment on chronic illness care (PACIC) questionnaire: the malay version. BMC Fam Pract. 2018;19:1–11.

Alharbi NS, Alotaibi M, de Lusignan S. An analysis of health policies designed to control and prevent diabetes in Saudi Arabia. Glob J Health Sci. 2016;8(11):233.

Schwenke M, Welzel FD, Luck-Sikorski C, Pabst A, Kersting A, Blüher M, et al. Psychometric properties of the Patient Assessment of Chronic Illness Care measure (PACIC-5A) among patients with obesity. BMC Health Serv Res. 2019;19:1–12.

Shah NR, Aragones A, Schaefer EW, Stevens D, Gourevitch MN, Glasgow RE. Peer reviewed: validation of the Spanish translation of the Patient Assessment of Chronic Illness Care (PACIC) Survey. Prev Chronic Dis. 2008;5(4).

Mohseni M, Shams Ghoreishi T, Houshmandi S, Moosavi A, Azami-Aghdash S, Asgarlou Z. Challenges of managing diabetes in Iran: meta-synthesis of qualitative studies. BMC Health Serv Res. 2020;20:1–12.

Sachdeva S, Khalique N, Ansari MA, Khan Z, Mishra SK, Sharma G. Cultural determinants: addressing barriers to holistic diabetes care. J Social Health Diabetes. 2015;3(01):033–8.

World Health Organization. Process of translation and adaptation of instruments. 2016. http://www.who.int/substance_abuse/research_tools/translation/en/ .

Hajizadeh E, Asghari M. Statistical methods and analyses in health and biosciences: a methodological approach. Tehran, Iran: Iranian Student Book Agency; 2011.

DeVon HA, Block ME, Moyle-Wright P, Ernst DM, Hayden SJ, Lazzara DJ, et al. A psychometric toolbox for testing validity and reliability. J Nurs Scholarsh. 2007;39(2):155–64.

Lawshe C. A quantitative approach to content validity. Personnel Pschycology. 1975;28:563–75.

Yaghmaie F. Content validity and its estimation. J Med Educ. 2003;3(1).

Atkinson G, Nevill AM. Statistical methods for assessing measurement error (reliability) in variables relevant to sports medicine. Sports Med. 1998;26:217–38.

Byrne BM. Structural equation modeling with AMOS: basic concepts, applications, and programming (multivariate applications series). New York: Taylor & Francis Group. 2010;396(1):7384.

Rosemann T, Laux G, Szecsenyi J, Grol R. The Chronic Care Model: congruency and predictors among primary care patients with osteoarthritis. BMJ Qual Saf. 2008;17(6):442–6.

Frei A, Senn O, Huber F, Vecellio M, Steurer J, Woitzek K, et al. Congruency of diabetes care with the Chronic Care Model in different Swiss health care organisations from the patients’ perspective: a cross sectional study. Swiss Med Wkly. 2014;144(3536):w13992–w.

PubMed Google Scholar

Szecsenyi J, Rosemann T, Joos S, Peters-Klimm F, Miksch A. German diabetes disease management programs are appropriate for restructuring care according to the chronic care model: an evaluation with the patient assessment of chronic illness care instrument. Diabetes Care. 2008;31(6):1150–4.

Rajabpour S, Rayyani M, Mangolian shahrbabaki P. The relationship between Iranian patients’ perception of holistic care and satisfaction with nursing care. BMC Nurs. 2019;18:1–7.

Sadeghi S, Kazemnejad A. Effects of comprehensive-care program on patients’ satisfaction with trauma in emergency department. Iran J Cardiovasc Nurs. 2013;2(3):54–62.

Drewes HW, de Jong-van Til JT, Struijs JN, Baan CA, Tekle FB, Meijboom BR et al. Measuring chronic care management experience of patients with diabetes: PACIC and PACIC + validation. Int J Integr care. 2012;12.

Download references

Acknowledgements

This research is part of the SM thesis in geriatric nursing. The authors would like to express their sincere gratitude and appreciation to the Research Deputy of Qazvin University of Medical Sciences and all the older patients who participated in this study for their cooperation and contribution.

This article was extracted from a student thesis in geriatric nursing.

Author information

Authors and affiliations.

Student Research Committee, Qazvin University of Medical Sciences, Qazvin, Iran

Sahar Maroufi

Social Determinants of Health Research Center, Research Institute for Prevention of Non-Communicable Diseases, Qazvin University of Medical Sciences, Qazvin, Iran

Leila Dehghankar

Department of Nursing, Guilan University of Medical Sciences, Rasht, Iran

Medical Microbiology Research Center, Qazvin University of Medical Sciences, Qazvin, Iran

Ahad Alizadeh

Non-communicable Diseases Research Center, Research Institute for Prevention of Non-communicable Diseases, Qazvin University of Medical Sciences, Qazvin, Iran

Mohammad Amerzadeh & Seyedeh Ameneh Motalebi

You can also search for this author in PubMed Google Scholar

Contributions

S.M, SA.M, and L.D conceived and designed the research method and helped to draft the manuscript. S.M collected the data. A.A performed the statistical analysis. S.M, SA.M, L.D, and M.A revised the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Seyedeh Ameneh Motalebi .

Ethics declarations

Ethics approval and consent to participate.

The present study has been approved by the Ethics Committee of Qazvin University of Medical Sciences (IR.QUMS.REC.1401.148). Prior to the study, a general overview of the objectives was provided to the older patients. All participants were assured that their information would remain confidential. Informed consent has been obtained from the participants, their parents and legally authorized representatives in this study.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary material 1., supplementary material 2., rights and permissions.

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ .

Reprints and permissions

About this article

Cite this article.

Maroufi, S., Dehghankar, L., Alizadeh, A. et al. Transcultural adaptation and validation of Persian Version of Patient Assessment of Chronic Illness Care (PACIC-5As) Questionnaire in Iranian older patients with type 2 diabetes. BMC Health Serv Res 24 , 1073 (2024). https://doi.org/10.1186/s12913-024-11557-0

Download citation

Received : 07 January 2024

Accepted : 09 September 2024

Published : 16 September 2024

DOI : https://doi.org/10.1186/s12913-024-11557-0

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Diabetes mellitus
Chronic disease
Health care
Psychometrics

BMC Health Services Research

ISSN: 1472-6963

General enquiries: [email protected]

Have a language expert improve your writing

Run a free plagiarism check in 10 minutes, generate accurate citations for free.

Knowledge Base

Methodology

What Is Predictive Validity? | Examples & Definition

What Is Predictive Validity? | Examples & Definition

Published on September 15, 2022 by Kassiani Nikolopoulou . Revised on June 22, 2023.

Predictive validity refers to the ability of a test or other measurement to predict a future outcome. Here, an outcome can be a behavior, performance, or even disease that occurs at some point in the future.

Predictive validity is a subtype of criterion validity . It is often used in education, psychology, and employee selection.

What is predictive validity, predictive validity example, predictive vs. concurrent validity, how to measure predictive validity, other interesting articles, frequently asked questions.

Predictive validity is demonstrated when a test can predict a future outcome. To establish this type of validity , the test must correlate with a variable that can only be assessed at some point in the future—i.e., after the test has been administered.

To assess predictive validity, researchers examine how the results of a test predict future performance. For example, SAT scores are considered predictive of student retention: students with higher SAT scores are more likely to return for their sophomore year. Here, you can see that the outcome is, by design, assessed at a point in the future.

Receive feedback on language, structure, and formatting

Professional editors proofread and edit your paper by focusing on:

Academic style
Vague sentences
Style consistency

See an example

A test score has predictive validity when it can predict an individual’s performance in a narrowly defined context, such as work, school, or a medical context.

To establish the predictive validity of your survey, you ask all recently hired individuals to complete the questionnaire . One year later, you check how many of them stayed.

Tests aimed at screening job candidates, prospective students, or individuals at risk of a specific health issue often are designed with predictive validity in mind.

Predictive and concurrent validity are both subtypes of criterion validity . They both refer to validation strategies in which the predictive ability of a test is evaluated by comparing it against a certain criterion or “gold standard.” Here,the criterion is a well-established measurement method that accurately measures the construct being studied.

The main difference between predictive validity and concurrent validity is the time at which the two measures are administered.

In predictive validity, the criterion variables are measured after the scores of the test.
In concurrent validity , the scores of a test and the criterion variables are obtained at the same time .

Predictive validity is measured by comparing a test’s score against the score of an accepted instrument—i.e., the criterion or “gold standard.”

The measure to be validated should be correlated with the criterion variable. Correlation between the scores of the test and the criterion variable is calculated using a correlation coefficient , such as Pearson’s r . A correlation coefficient expresses the strength of the relationship between two variables in a single value between −1 and +1.

Correlation coefficient values can be interpreted as follows:

r = 1: There is perfect positive correlation.
r = 0: There is no correlation at all.
r = −1: There is perfect negative correlation.

You can automatically calculate Pearson’s r in Excel , R , SPSS, or other statistical software.

A strong positive correlation provides evidence of predictive validity. In other words, it indicates that a test can correctly predict what you hypothesize it should. However, the presence of a correlation doesn’t mean causation , and if your gold standard shows any signs of research bias , it will affect your predictive validity as well.

The higher the correlation between a test and the criterion, the higher the predictive validity of the test. No correlation or a negative correlation indicates that the test has poor predictive validity.

Here's why students love Scribbr's proofreading services

Discover proofreading & editing

If you want to know more about statistics , methodology , or research bias , make sure to check out some of our other articles with explanations and examples.

Normal distribution
Degrees of freedom
Null hypothesis
Discourse analysis
Control groups
Mixed methods research
Non-probability sampling
Quantitative research
Ecological validity

Research bias

Rosenthal effect
Implicit bias
Cognitive bias
Selection bias
Negativity bias
Status quo bias

Criterion validity evaluates how well a test measures the outcome it was designed to measure. An outcome can be, for example, the onset of a disease.

Criterion validity consists of two subtypes depending on the time at which the two measures (the criterion and your test) are obtained:

Concurrent validity is a validation strategy where the the scores of a test and the criterion are obtained at the same time .
Predictive validity is a validation strategy where the criterion variables are measured after the scores of the test.

Reliability and validity are both about how well a method measures something:

Reliability refers to the consistency of a measure (whether the results can be reproduced under the same conditions).
Validity refers to the accuracy of a measure (whether the results really do represent what they are supposed to measure).

If you are doing experimental research, you also have to consider the internal and external validity of your experiment.

Validity tells you how accurately a method measures what it was designed to measure. There are four main types of validity :

Construct validity : Does the test measure the construct it was designed to measure?
Face validity : Does the test appear to be suitable for its objectives ?
Content validity : Does the test cover all relevant parts of the construct it aims to measure.
Criterion validity : Do the results accurately measure the concrete outcome they are designed to measure?

Cite this Scribbr article

If you want to cite this source, you can copy and paste the citation or click the “Cite this Scribbr article” button to automatically add the citation to our free Citation Generator.

Nikolopoulou, K. (2023, June 22). What Is Predictive Validity? | Examples & Definition. Scribbr. Retrieved September 16, 2024, from https://www.scribbr.com/methodology/predictive-validity/

Is this article helpful?

Kassiani Nikolopoulou

Other students also liked, correlation coefficient | types, formulas & examples, what is concurrent validity | definition & examples, what is criterion validity | definition & examples, what is your plagiarism score.

IMAGES

10 Criterion Validity Examples (2024)
Criterion validity: 2 Types & Examples
A Simple Explanation of Criterion Validity
Validity In Psychology Research: Types & Examples
PPT
PPT

VIDEO

VALIDITY-NURSING RESEARCH
Validity vs Reliability explained with examples
Criterion-Related validity (08.05)
Validity in Measurement Tools
CRITERION VALIDITY types of criterion validity CONCURRENT VALIDITY, PREDICTIVE VALIDITY
Validity of Research Instrument: Construct Validity

COMMENTS

What Is Criterion Validity?
Example: Criterion validity A researcher wants to know whether a college entrance exam is able to predict future academic performance. First-semester GPA can serve as the criterion variable, as it is an accepted measure of academic performance. ... For example, you can compare a new questionnaire with an established one. In medical research ...
Criterion Validity: Definition & Examples
Other examples of criterion-related validity include measures of physical fitness being related to on-the-job safety and measures of memory or knowledge being related to academic performance. ... (2012). Cognitive tests used in selection can have content validity as well as criterion validity: A broader research review and implications for ...
10 Criterion Validity Examples
10 Criterion Validity Examples. Criterion validity is a type of validity that examines whether scores on one test are predictive of performance on another. For example, if employees take an IQ text, the boss would like to know if this test predicts actual job performance. If an IQ test does predict job performance, then it has criterion validity.
Criterion Validity
Consumer Research: Criterion validity is useful in consumer research to assess the accuracy of measures in predicting consumer behavior or preferences. For example, a survey measuring customer satisfaction may be evaluated for its criterion validity by comparing the results with actual customer behaviors, such as repeat purchases or referrals.
Criterion Validity: Definition, Assessing & Examples
Learn more about Validity in Research: Types and Examples. Evaluating Criterion Validity. A crucial aspect of criterion validity is that an accepted standard of comparison exists. You need a recognized, observable measure that theoretically must correlate with the construct in a known manner. If one doesn't exist, you can't assess criterion ...
The 4 Types of Validity in Research
Criterion validity evaluates how well a test measures the outcome it was designed to measure. An outcome can be, for example, the onset of a disease. Criterion validity consists of two subtypes depending on the time at which the two measures (the criterion and your test) are obtained:
What Is Criterion Validity?
Published on 2 September 2022 by Kassiani Nikolopoulou. Criterion validity (or criterion-related validity) evaluates how accurately a test measures the outcome it was designed to measure. An outcome can be a disease, behaviour, or performance. Concurrent validity measures tests and criterion variables in the present, while predictive validity ...
Criterion Validity
Criterion validity is a key concept in research methodology, ensuring that a quantitative test or measurement accurately reflects the intended outcome. This form of validity is essential for evaluating the effectiveness of various assessments and tools across different fields. By comparing a new measure to outcomes from an established test ...
Criterion Validity: Measuring, Pros & Cons, Real-World Uses
Criterion validity or concrete validity refers to a method of testing the correlation of a variable to a concrete outcome. Higher education institutions and employers use criterion validity testing to model an applicant's potential performance. Some organizations also use it to model retention rates. A properly designed test can predict future ...
Validity in Research and Psychology: Types & Examples
In this vein, there are many different types of validity and ways of thinking about it. Let's take a look at several of the more common types. Each kind is a line of evidence that can help support or refute a test's overall validity. In this post, learn about face, content, criterion, discriminant, concurrent, predictive, and construct ...
What is Criterion Validity?
Criterion validity (or criterion-related validity) is an assessment of how accurately a test or instrument captures the outcome it was designed to measure. These outcomes are generally constructs that cannot be directly measured, such as intelligence or happiness. Such constructs occur frequently in psychology research.
A Simple Explanation of Criterion Validity
There are two main types of criterion validity: 1. Predictive Validity. The first type of criterion validity is known as predictive validity, which determines whether or not the measurement of one variable is able to accurately predict the measurement of some variable in the future. The previous example of measuring a student's college ...
Criterion Validity: Definition, Types of Validity
Types of Criterion Validity. The three types are: Predictive Validity: if the test accurately predicts what it is supposed to predict. For example, the SAT exhibits predictive validity for performance in college. It can also refer to when scores from the predictor measure are taken first and then the criterion data is collected later.
Criterion Validity
In its most basic form, criterion validity refers to a construct's or test score's relatedness to a relevant outcome of interest, the criterion. Criterion validity is conceptualized as a between-person comparison important component of the validation process of a test score. Without criterion validity, the application of a test score may ...
The 4 Types of Validity
Face validity. Face validity considers how suitable the content of a test seems to be on the surface. It's similar to content validity, but face validity is a more informal and subjective assessment. Example: Face validity. You create a survey to measure the regularity of people's dietary habits. You review the survey items, which ask ...
Validity In Psychology Research: Types & Examples
Types of Validity In Psychology. Two main categories of validity are used to assess the validity of the test (i.e., questionnaire, interview, IQ test, etc.): Content and criterion. Content validity refers to the extent to which a test or measurement represents all aspects of the intended content domain. It assesses whether the test items ...
Validity
Examples of Validity. Internal Validity: A randomized controlled trial (RCT) where the random assignment of participants helps eliminate biases. External Validity: A study on educational interventions that can be applied to different schools across various regions. Construct Validity: A psychological test that accurately measures depression levels.
Reliability vs. Validity in Research
Reliability is about the consistency of a measure, and validity is about the accuracy of a measure.opt. It's important to consider reliability and validity when you are creating your research design, planning your methods, and writing up your results, especially in quantitative research. Failing to do so can lead to several types of research ...
Content Validity in Research: Definition & Examples
Olivia Guy-Evans, MSc. Content validity is a type of criterion validity that demonstrates how well a measure covers the construct it is meant to represent. It is important for researchers to establish content validity in order to ensure that their study is measuring what it intends to measure. There are several ways to establish content ...
What Is Content Validity?
Revised on June 22, 2023. Content validity evaluates how well an instrument (like a test) covers all relevant parts of the construct it aims to measure. Here, a construct is a theoretical concept, theme, or idea: in particular, one that cannot usually be measured directly. Content validity is one of the four types of measurement validity.
Construct Validity
Construct Validity | Definition, Types, & Examples. Published on February 17, 2022 by Pritha Bhandari.Revised on June 22, 2023. Construct validity is about how well a test measures the concept it was designed to evaluate. It's crucial to establishing the overall validity of a method.. Assessing construct validity is especially important when you're researching something that can't be ...
Transcultural adaptation and validation of Persian Version of Patient
The Patient Assessment of Chronic Illness Care (PACIC-5As) questionnaire has been designed to evaluate the healthcare experiences of individuals with chronic diseases such as diabetes. Older adults are at higher risk for diabetes and its associated complications. The aim of this study was transcultural adaptation and evaluation of the validity and reliability of the PACIC-5As questionnaire in ...
Internal Validity in Research
Research example In your study of coffee and memory, the external validity depends on the selection of the memory test, the participant inclusion criteria, and the laboratory setting. For example, restricting your participants to college-aged people enhances internal validity at the expense of external validity - the findings of the study may ...
What Is Predictive Validity?
Predictive validity refers to the ability of a test or other measurement to predict a future outcome. Here, an outcome can be a behavior, performance, or even disease that occurs at some point in the future. Example: Predictive validity. A pre-employment test has predictive validity when it can accurately identify the applicants who will ...

10 Criterion Validity Examples

Examples of Criterion Validity

2. The SAT and College GPA

3. The Housing Market

4. Psychological Correlates of Academic Performance

5. The In-Basket Activity

6. Criterion Validity and Life-Expectancy

7. The NFL Combine

8. Bus Driver Course Performance and Bus Accidents

9. Job Simulation and Nursing Competence

10. Wearable Trackers and Steps Walked

Leave a Comment Cancel Reply

Criterion Validity | Definition, Types & Examples

Introduction

Concurrent validity

Predictive validity

Convergent validity

Discriminant validity

Research made simple and easy with ATLAS.ti

Construct validity

Face validity

Content validity

Selecting appropriate criteria

Criterion validity testing

Addressing potential challenges

Powerful qualitative data analysis starts with ATLAS.ti

What is criterion validity?

Make research less tedious

Predictive validity

Real-world use of predictive validity tests

Concurrent validity

Real-world use of concurrent validity tests

What are correlation coefficients?

What is test-retest reliability?

What is construct validity?

What is content validity?

What is a valid measure?

Should you be using a customer insights hub?

Editor’s picks

Latest articles

Validity in Research and Psychology: Types & Examples

What is Validity in Psychology, Research, and Statistics?

Evaluating Validity

Types of Validity

Face Validity

Content Validity

Criterion Validity

Discriminant Validity

Concurrent Validity

Predictive Validity

Construct Validity

Share this:

Reader Interactions

Comments and Questions Cancel reply

Criterion Validity: Definition, Types of Validity

What is Criterion Validity?

Types of Criterion Validity

Criterion Validity

Introduction

Access this chapter

Author information

Corresponding author

Editor information

Section Editor information

Rights and permissions

Copyright information

About this entry

Download citation

Share this entry

Have a language expert improve your writing

The 4 Types of Validity | Types, Definitions & Examples

Table of contents

What is a construct?

What is construct validity?

Prevent plagiarism, run a free check.

What is a criterion variable?

What is criterion validity?

Cite this Scribbr article

Is this article helpful?

Fiona Middleton